Speech Analysis Based on Modeling the Effective Voice Source

نویسندگان

  • M. Shahidur Rahman
  • Tetsuya Shimamura
چکیده

A new system identification based method has been proposed for accurate estimation of vocal tract parameters. An often encountered problem in using the conventional linear prediction analysis is due to the harmonic structure of the excitation source of voiced speech. This harmonic characteristic is coupled with the estimation of autoregressive (AR) coefficients that results in difficulties in estimating the vocal tract filter. This paper models the effective voice source from the residual obtained through the covariance analysis in the first-pass which is then used as input to the second-pass least-square analysis. A better source-filter separation is thus achieved. The formant frequencies and corresponding bandwidths obtained using the proposed method for synthetic vowels are found to be accurate up to a factor of more than three (in percent) compared to the conventional method. Since the source characteristic is taken into account, local variations due to the positioning of analysis window are reduced significantly. The validity of the proposed method is also examined by inspecting the spectra obtained from natural vowel sounds uttered by high-pitched female speaker. key words: glottal waveform, effective voice source, linear prediction, least square method, system identification

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing the Voice Handicap Index Scores in Groups with Structural and Functional Voice Disorders

Objective: The effects of voice disorders vary from person to person. Occupation, work environment, life, and family reaction are variables that affect one’s perception of his/her own as an impaired voice. Voice Handicap Index (VHI) has not yet been used to compare the degree of voice disorders. Assuming that the quality of life may be different under a variety of voice disorders and that diffe...

متن کامل

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Investigating the formal effect of rear wall structure on acoustic parameters of speech halls (Research Article)

Referring to the rear wall in a hall is the furthest element rather than the voice source, therefor the reflections of this structural member play important role in music and speech intelligibly, especially for one-third behind audiences. Hence the form of these structures can be very effective in the acoustical quality of speech halls and auditoria. In this study, four formic structures are ex...

متن کامل

Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling

The paper presents a voice source waveform modeling techniques based on principal component analysis (PCA) and Gaussian mixture modeling (GMM). The voice source is obtained by inverse-filteirng speech with the estimated vocal tract filter. This decomposition is useful in speech analysis, synthesis, recognition and coding. Existing models of the voice source signal are based on function-fitting ...

متن کامل

Aperiodicity control in ARX-based speech analysis-synthesis method

We present an improved algorithm for a robust speech analysissynthesis method based on an auto-regressive with exogenous input (ARX) speech production model proposed previously. The speech analysis-synthesis method is capable of making an automatic estimation of vocal tract (formant) and voice source parameters from a speech utterance, generating accurate formant values even for very high-pitch...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEICE Transactions

دوره 89-D  شماره 

صفحات  -

تاریخ انتشار 2006